Filtered poisson process bandit on a continuum

نویسندگان

چکیده

We consider a version of the continuum armed bandit where an action induces filtered realisation non-homogeneous Poisson process. Point data in sample are then revealed to decision-maker, whose reward is total number points. Using knowledge function governing filtering, but without intensity function, decision-maker seeks maximise expected points over T rounds. propose upper confidence bound algorithm for this problem utilising data-adaptive discretisation space. This approach enjoys O˜(T2/3) regret under Lipschitz assumption on function. provide lower bounds any problem, via new related finite-armed bandits, and show that orders match up logarithmic factor.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fractional Poisson Process

For almost two centuries, Poisson process with memoryless property of corresponding exponential distribution served as the simplest, and yet one of the most important stochastic models. On the other hand, there are many processes that exhibit long memory (e.g., network traffic and other complex systems). It would be useful if one could generalize the standard Poisson process to include these p...

متن کامل

Navigation on a Poisson Point Process

On a locally finite point set, a navigation defines a path through the point set from a point to another. The set of paths leading to a given point defines a tree, the navigation tree. In this article, we analyze the properties of the navigation tree when the point set is a Poisson point process on R. We examine the local weak convergence of the navigation tree, the asymptotic average of a func...

متن کامل

Contextual Gaussian Process Bandit Optimization

How should we design experiments to maximize performance of a complexsystem, taking into account uncontrollable environmental conditions? Howshould we select relevant documents (ads) to display, given information about theuser? These tasks can be formalized as contextual bandit problems, where at eachround, we receive context (about the experimental conditions, the query), and<l...

متن کامل

the impact of portfolio assessment on iranian efl students essay writing: a process-oriented approach

this study was conducted to investigate the impact of portfolio assessment as a process-oriented assessment mechanism on iranian efl students’ english writing and its subskills of focus, elaboration, organization, conventions, and vocabulary. out of ninety juniors majoring in english literature and translation at the university of isfahan, sixty one of them who were at the same level of writing...

15 صفحه اول

Fuzzy Random Homogeneous Poisson Process and Compound Poisson Process

By dealing with interarrival times as exponentially distributed fuzzy random variables, a fuzzy random homogeneous Poisson process and a fuzzy random compound Poisson process are respectively defined. Several theorems on the two processes are provided, respectively.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: European Journal of Operational Research

سال: 2021

ISSN: ['1872-6860', '0377-2217']

DOI: https://doi.org/10.1016/j.ejor.2021.03.033